Artificial Intelligence
Multimodal large language models (MLLMs) have shown remarkable progress in high-level semantic tasks such as visual question answering, image captioning, and emotion recognition. However, despite advancements, there remains a lack of standardized benchmarks for evaluating MLLMs performance in multi-object sentiment analysis, a key task in semantic understanding. To address this gap, we introduce MOSABench, a novel evaluation dataset designed specifically for multi-object sentiment analysis.
- Categories:
kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti
- Categories:
These datasets originate from two separate power transformers, identified as Transformer 1 and Transformer 2. Each dataset is further categorized into two distinct time intervals for data collection. The first time interval, labeled as "m," represents a short-term sampling period with data recorded every 15 minutes, capturing detailed temporal fluctuations over shorter durations. The second time interval, labeled as "h," signifies a longer-term sampling period with data recorded at hourly intervals, providing an overview of broader trends and patterns.
- Categories:
The Human voice Natural Language from On-demand media (HENLO) dataset is a high-quality emotional speech dataset created to address the need for representative and realistic data in speech emotion recognition research. Unlike many existing datasets, which rely on simulated emotions performed by untrained speakers or directed participants, HENLO sources its data from professionally produced films and podcasts available on Media On-Demand (MOD).
- Categories:
In this paper we use Natural Language Processing techniques to improve different machine learning approaches (Support Vector Machines (SVM), Local SVM, Random Forests) to the problem of automatic keyphrases extraction from scientific papers. For the evaluation we propose a large and high-quality dataset: 2000 ACM papers from the Computer Science domain. We evaluate by comparison with expert-assigned keyphrases.
- Categories:
Backlog refinement is a critical process within Agile practices which often faces challenges like ambiguous user stories, prioritization difficulties, and cognitive overload among team members. Teams spend a lot of time in grooming user stories and refining them based on the client or business requirements and customer feedback. In this paper, we present an empirical study, exploring the integration of Generative AI (GenAI), specifically Large Language Models into backlog refinement workflows to address these challenges.
- Categories:
Vision-language (VL) datasets are essential for advancing the capabilities of VL models, particularly in specialized domains like medical imaging. However, existing medical VL datasets are relatively small and predominantly focus on chest X-rays, limiting their applicability to other areas. To address this gap, we introduce the Skin-Path dataset, a comprehensive VL dataset specifically curated for histopathology.
- Categories:
Hyperspectral images are represented by numerous
narrow wavelength bands in the visible and near-infrared parts
of the electromagnetic spectrum. As hyperspectral imagery gains
traction for general computer vision tasks, there is an increased
need for large and comprehensive datasets for use as training
data.
Recent advancements in sensor technology allow us to capture
hyperspectral data cubes at higher spatial and temporal reso-
lution. However, there are few publicly available multi-purpose
- Categories:
The necessity for strong security measures to fend off cyberattacks has increased due to the growing use of Industrial Internet of Things (IIoT) technologies. This research introduces IoTForge Pro, a comprehensive security testbed designed to generate a diverse and extensive intrusion dataset for IIoT environments. The testbed simulates various IIoT scenarios, incorporating network topologies and communication protocols to create realistic attack vectors and normal traffic patterns.
- Categories: